Low-rank Bandit Methods for High-dimensional Dynamic Pricing

نویسندگان

Jonas Mueller

Vasilis Syrgkanis

Matt Taddy

چکیده

We consider high dimensional dynamic multi-product pricing with an evolving but low-dimensional linear demand model. Assuming the temporal variation in cross-elasticities exhibits low-rank structure based on fixed (latent) features of the products, we show that the revenue maximization problem reduces to an online bandit convex optimization with side information given by the observed demands. We design dynamic pricing algorithms whose revenue approaches that of the best fixed price vector in hindsight, at a rate that only depends on the intrinsic rank of the demand model and not the number of products. Our approach applies a bandit convex optimization algorithm in a projected low-dimensional space spanned by the latent product features, while simultaneously learning this span via online singular value decomposition of a carefully-crafted matrix containing the observed demands.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Integration of dynamic pricing and overselling with opportunistic cancellation

Abstract We extend the concept of dynamic pricing by integrating it with “overselling with opportunistic cancellation” option, within the framework of dynamic policy. Under this strategy, to sell a stock of perishable product (or capacity) two prices are offered to customers at any given time period. Customers are categorized as high-paying and low-paying ones. The seller deliberately oversel...

متن کامل

Dynamic Online Pricing with Incomplete Information Using Multi-Armed Bandit Experiments

Consider the pricing decision for a manager at large online retailer, such as Amazon.com, that sells millions of products. The pricing manager must decide on real-time prices for each of these product. Due to the large number of products, the manager must set retail prices without complete demand information. A manager can run price experiments to learn about demand and maximize long run profit...

متن کامل

Dynamic Pricing under Finite Space Demand Uncertainty: A Multi-Armed Bandit with Dependent Arms

We consider a dynamic pricing problem under unknown demand models. In this problem a seller offers prices to a stream of customers and observes either success or failure in each sale attempt. The underlying demand model is unknown to the seller and can take one of N possible forms. In this paper, we show that this problem can be formulated as a multi-armed bandit with dependent arms. We propose...

متن کامل

Online low-rank + sparse structure learning for dynamic network tracking

Recent developments in information technology have enabled us to collect and analyze high dimensional and higher order data such as tensors. High dimensional data usually lies in a lower dimensional subspace and identifying this low-dimensional structure is important in many signal and information processing applications. Traditional subspace estimation approaches have been limited to vector-ty...

متن کامل

Latent Contextual Bandits: A Non-Negative Matrix Factorization Approach

We consider the stochastic contextual bandit problem with a large number of observed contexts and arms, but with a latent low-dimensional structure across contexts. This low dimensional (latent) structure encodes the fact that both the observed contexts and the mean rewards from the arms are convex mixtures of a small number of underlying latent contexts. At each time, we are presented with an ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1801.10242 شماره

صفحات -

تاریخ انتشار 2018

Low-rank Bandit Methods for High-dimensional Dynamic Pricing

نویسندگان

چکیده

منابع مشابه

Integration of dynamic pricing and overselling with opportunistic cancellation

Dynamic Online Pricing with Incomplete Information Using Multi-Armed Bandit Experiments

Dynamic Pricing under Finite Space Demand Uncertainty: A Multi-Armed Bandit with Dependent Arms

Online low-rank + sparse structure learning for dynamic network tracking

Latent Contextual Bandits: A Non-Negative Matrix Factorization Approach

عنوان ژورنال:

اشتراک گذاری